- Me
- Lessons Learned
- Containers
- Kubernetes
- Containers for Data Scientists
- Q/A
- Advertisement
2019-11-25
–>
Education - UIC
Bachelors - Philosophy/Mathematics
Masters - Game Theory/OR
PhD - Statistics/Machine Learning
Postdoc - Tsinghua University, Beijing
Professional
Civis Analytics - Data Scientist
Vivaki - Data Scientist
Sears Holdings - Data Scientist
IBM - Open Source Analytics Technical Evangelist
IBM - Executive Architect
Personal
Cubs MtG Music Politics
“Virtual machines are software computers that provide the same functionality as physical computers. Like physical computers, they run applications and an operating system. However, virtual machines are computer files that run on a physical computer and behave like a physical computer. In other words, virtual machines behave as separate computer systems.”
https://www.vmware.com/topics/glossary/content/virtual-machine
For containers, those computer files are called images.
Containers are instantiations of an image.
https://www.electronicdesign.com/dev-tools/what-s-difference-between-containers-and-virtual-machines
“It’s an application-centric way to deliver high-performing, scalable applications on the infrastructure of your choosing.”
k8s
“Kubernetes is an open-source container orchestration system for automating application deployment, scaling, and management”
Without Containers
Have:
Don’t Have:
Without Containers
Without Containers
> library("ggplot2")
Warning message:
package ‘ggplot2’ was built under R version 3.4.4
With Containers
Everything is packaged up into an immutable image.
As long you can run the container, you can reproduce the research!
apiVersion: v1
kind: Pod
metadata:
name: frontend
spec:
containers:
- name: db
image: mysql
env:
- name: MYSQL_ROOT_PASSWORD
value: "password"
resources:
requests:
memory: "64Mi"
cpu: "250m"
limits:
memory: "128Mi"
cpu: "500m"
- name: wp
image: wordpress
resources:
requests:
memory: "64G"
cpu: "16"
limits:
memory: "128G"
cpu: "16"
Alternative:
“Terraform is an open-source infrastructure as code software tool created by HashiCorp. It enables users to define and provision a datacenter infrastructure using a high-level configuration language known as Hashicorp Configuration Language, or optionally JSON.”
R code as an API
With Shiny on Kubernetes, your app can scale!
https://thecooperreview.com/10-tricks-appear-smart-meetings/
Questions?
Cloud Pak for Multicloud Management
Cloud Automation Manager
built on open source:
Ansible Tower
built on open source: